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Qh 1 Abstract 

The search for the Higgs boson and for physics beyond the Standard 

<N . Model are the major motivations behind the LHC experiment. In 

many scenarios the success of the experiment depends on the knowledge 

r"| , of signal and background event rates at least at one-loop precision. 

Q-T We present the approach of the GOLEM collaboration to build a highly 

automated framework for the calculation of matrix elements at the 

' one-loop level, which is based on the evaluation of Feynman diagrams. 

Part of this effort is an open-source library for the numerical evaluation 

of tensor integrals. As an application, some results for the process 

£Sj ■ PP — > bbbb calculated with this method are presented. 
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The Standard Model of Particle Physics has been tested by previous collider 
^ ■ experiments to a very high precision [Tj. Despite its great success, many fun- 

damental questions such as the nature of dark matter cannot be addressed 
within the Standard Model. Two main goals of the next collider exper- 
iment, the LHC, are therefore the discovery (or exclusion) of a Standard 
Model Higgs boson and the measurement of any new particles accessible 
by the collider energy i2] . Due to the purely hadronic initial state of the 
collisions one expects the interactions mainly to be governed by QCD. A 
precise understanding of both the signal and the background will be crucial 
for most Higgs discovery channels and for the discrimination of different 
scenarios beyond the Standard Model. For many processes a Leading Order 
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(LO) approximation will not suffice and has to be amended by higher order 
corrections. 

The calculation of matrix elements at LO has become an automated 
routine for which many computer programs are available [U El [H [9] . 
Next to Leading Order (NLO) calculations, however, have never reached 
this level of automation. Especially in the case of many particle final states 
(> 3 final state particle) automatisation is not straightforward as one eas- 
ily hits the limits of current computer technology. A full NLO calculation 
in QCD consists of a 2 — > n particle tree-level contribution (LO), the real 
emission of an extra parton (2 — > n + 1, tree-level) and virtual corrections 
(2 — * n, one-loop diagrams). Both real and virtual corrections can contain 
infrared divergences which only cancel in the sum of both contributions. 
These divergences can be dealt with by subtraction methods [1Q|. [TT] which 
have also become available as automated implementations [12 } 113 ] [TH I15j . 
The only missing ingredient for a full automation of NLO calculations are 
the virtual corrections. Although many different methods have been pro- 
posed [HI HZl [El [201 EH [221 [231 El] no fully automated implementation has 
been made available yet. The very limited number of results found in the 
literature for processes with four final state particles [23 GU [27J EHl (23 [30] 
underlines the importance of automatisation in the context of one-loop cal- 
culations. 

The GOLEM collaboration focuses on the development of such an automa- 
tised tool for one-loop matrix element calculations^. We have applied the 
GOLEM method to calculate the QCD one-loop corrections of the process 
uu — > bbbb, which is a subprocess of pp — > bbbb. This process is a particu- 
lar important background in MSSM Higgs searches at large values of tan j3, 
where one of the Higgs bosons decays predominantly into bb pairs. 



2 The GOLEM Approach 

Our approach is based on the calculation of Feynman diagrams. We generate 
the diagrams and their corresponding algebraic expressions using QGraf [31] 
and project them on a colour and helicity basis. The integration over the 
momentum of the virtual particle introduces tensor integrals of the form 



Tn;iMi,...,nr 
1 N 



These integrals are reduced to a basis of scalar integrals by the method 
described in [321 133] : we use two independent implementations where 

a) the tensor reduction is carried out on a symbolical level and the am- 
plitude is represented in terms of Mandelstam variables, Levi-Civita 
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tensors and the commonly used standard basis of scalar one-loop in- 
tegrals 

■M{A},c = Cbox^4 + Ctri-^3 + Cbub-^2 + C ta dll ■ 

The algebraic reduction and further simplifications are achieved using 
Form [34] and Maple. 

b) a form factor representation is introduced for the tensor integrals and 
the tensor reduction is delayed until the numerical evaluation. The 
form factors are implemented in the Fortran library golem95 which is 
described in |35j. The amplitude is represented in terms of Mandel- 
stam variables, spinor bi-products u(pi)(l ±^§)u{pj) and an extended 
basis of one-loop integrals that allows for Feynman parameters in the 
numerator. 

In both strategies, Gram determinants in the denominator are either avoided 
or to some extent cancelled explicitly on the symbolical level, thus resulting 
in a numerically stable implementation of the matrix element. 

The results presented below have been obtained with implementation b) 
in which the user starts from a very minimal process description. All neces- 
sary files are generated by a Python script. The matrix element is obtained 
as a set of Fortran90 files which are compiled on the target system, in our 
case the ECDF cluster [36] . 

The direct integration of a one-loop matrix element over phase space 
with an adaptive Monte Carlo (MC) has two major disadvantages. The 
evaluation time of an NLO matrix element is considerable larger than that of 
a LO matrix element. Hence one should try to avoid unnecessary calls of the 
NLO matrix elements. The second problem are phase space regions where 
the chosen integral basis becomes linearly dependent and lead to numerical 
fluctuations. When these fluctuations reach the order of magnitude of the 
precision goal of the adaptive MC, the MC program tends to overestimate 
these phase space regions which can lead to numerical instabilities. 

We avoid these problems by performing the phase space integration as a 
reweighting of unweighted LO Monte Carlo events. We use WHIZARD [3] an 
adaptive Monte Carlo integrator to obtain a list of unweighted LO events 
{p}i such that an observable O can be written as 

r N 
(0) LO = J d$({p})\M W fO({p}) = Jim ^£°(M0 (2) 

i=l 

where the limit is understood in the statistical sense as a limit on the variance 
of the MC sum. The observable at one-loop precision is obtained as 

JV 

(0) onc _ loop = ton ^ £ K({p})0({p}i) (3) 
jv— >oo iv . — ; 
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with the local i^-factor 



K({p}) = • M LO-(-MLO+.Mvirt+I-.MLo) (4) 

The matrix elements M are understood as vectors in a given colour basis 
and I is the insertion operator as defined in [10J, which ensures that after 
UV-renormalisation all poles in l/(n — 4) cancel. The reweighting can be 
understood as importance sampling with the probability density 

1 d<r L o(M) , K s 

w({p\) oc . 5) 

(TLO d$({p}) 

It should be emphasized that the definition of the local i'T-factor in 
Eq. does not contain any real emission contributions and therefore the 
results below lack any physical interpretation. 



3 Results for uu — > bbbb 



The results for the virtual correction of the process uu — > bbbb have been ob- 
tained with rif = 5 massless quark flavours and for pr > 30 GeV, a rapidity 
cut of rj < 2.5 GeV and a separation cut of AR = ^(A^) 2 + (A??) 2 > 0.4. 
The centre of mass energy is 14TeV. In this data set we choose the scales 
as up = = J2i=iPT,i/^ and fold with the CTEQ6.5 parton distribution 
functions [37] . We work with the modifications to the dipoles and insertion 
operators as proposed in [38]; for the distributions below we have evalu- 
ated the insertion operator at on = 0.1. For this specific amplitude this 
modification induces a shift of the insertion operator of 
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On the ECDF cluster our code achieved a performance of 8.9 s (17.6 s) per 
phase space point and nodeH in double (quadruple) precision. The above 
timings suggest that an evaluation of the whole integration in quadruple 
precision is too costly to be practical and should be the last resort if double 
precision is not sufficient. The runtime will be improved further through 
code optimisations in the next version of the GOLEM code. 

Figure [T] shows the distribution of the values for the local iT-factor, the 
single pole and the double pole of an amplitude for 200,000 randomly chosen 
points in both double and quadruple precision. The distribution of the the 
-ftT-factor shows that the values are sharply peaked around the the integrated 
result of 0(1) and in double precision a small fraction of less than a percent 
of the points stretches out to atypically large values. It is clear that in a 
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data set of order 10 6 MC events a single outlier of that magnitude is already 
enough to tamper with the result. On the other hand, since the number of 
points in doubt is very small, an a posteriori test is enough and it suffices to 
re-evaluate those points at a higher precision. As possible test criteria we 
studied a cut on the K-factor, on the coefficient of the single pole and on 
the double pole; if the magnitude of the double precision result exceeds the 
cut the point is evaluated at higher precision. 
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Figure 1: Comparison between evaluations of the matrix element for 200,000 
MC events in double precision and quadruple precision. Shown are the 
distributions of the values of the local K-factor as defined in Eq. (jlj) (left) 
and the single l/(n — 4) (middle) and double l/(n — 4) 2 (right) pole of the 
matrix element. 

We have taken the same sample of 200,000 points to study the influence 
of the cut parameter on the integrated result. Figure [2] shows the relative 
error e re i = [o-(SP cut ) — cr(0)|/cr(0) on the cross-section versus the cut pa- 
rameter SP cu t- The steep increase of the error indicates an outlier in the 
-RT-factor that is not reflected in the cancellation of the pole. A very simi- 
lar picture emerges for a cut on the double pole (not shown). This lack of 
correlation between the pole cancellation and error on the integral can be 
circumvented by imposing a test on the local K- factor as shown in Figure [3l 
In the region between 2 < K cut < 5 a relative error of ~ 0.10% is achieved 
while only 0.5% of the phase space points need to be evaluated at a higher 
precision o 

3 The downwards trend at K cut ~ 10 is a statistical fluctuation. 
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Figure 2: Relative error on the integrated result versus a cut SP cu t on the 
coefficient of the single pole. If the cancellation of the single pole in double 
precision is worse than SP cut the data point is evaluated in quadruple preci- 
sion. The dashed curve indicates the percentage of points that fail the test 
and need re-evaluation (right y-axis). 




6 




Figure 4: Distributions of the transverse momentum (left) and the rapidity 
(right) of the hardest jet. The result for NLO v i r t is obtained from the fi- 
nite contribution of the virtual part of the NLO prediction, as described in 
Eq. (H). 




Figure 5: Stability under simultaneous variation of the renormalisation 
scale (jlr and the factorisation scale \xf around the central value of \i§ = 

EilPT.i/4. 
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4 Conclusion 



A successful interpretation of the LHC data will need precise predictions 
of both background and signal. For many processes this involves the eval- 
uation of one-loop QCD amplitudes with many particles in the final state. 
We have presented the approach of the GOLEM collaboration to automatise 
such calculations that allows to generate code for the matrix element which 
is both fast and numerically stable in all relevant phase space regions. The 
need to control the accuracy of the result has been emphasised and a poste- 
riori precision test has been proposed. We have introduced a new indirect 
integration method based on reweighting unweighted LO events by a local 
-RT-factor, which avoids certain problems that otherwise arise from a direct, 
adaptive MC integration of the one- loop matrix element. Results have been 
presented for the virtual corrections of the process uu — > bbbb, which is part 
of an important background for MSSM Higgs boson searches at the LHC. 

We have shown that our approach allows for efficient implementations 
for NLO predictions of processes with multi-particle final states and that 
the GOLEM project can lead to a fully automated tool for NLO calculations. 
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